Small Sample Statistics for Classi cation Error Rates

نویسنده

  • J. Kent Martin
چکیده

Several techniques for estimating the reliability of estimated error rates and for estimating the signicance of observed dierences in error rates are explored in this paper. Textbook formulas which assume a large test set, i.e., a normal distribution, are commonly used to approximate the condence limits of error rates or as an approximate signicance test for comparing error rates. Expressions for determining more exact limits and signicance levels for small samples are given here, and criteria are also given for determining when these more exact methods should be used. The assumed normal distribution gives a poor approximation to the condence interval in most cases, but is usually useful for signicance tests when the proper mean and variance expressions are used. A commonly used 62 signicance test uses an improper expression for , which is too low and leads to a high likelihood of Type I errors. Common machine learning methods for estimating signicance from observations on a single sample may be unreliable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bias in land cover change estimates due to misregistration

Land cover change may be overestimated due to positional error in multi-temporal images. To assess the potential magnitude of this bias, we introduced random positional error to identical classiŽ ed images and then subtracted them. False land cover change ranged from less than 5% for a 5-class AVHRR classiŽ cation, to more than 33% for a 20-class Landsat TM classiŽ cation. The potential for fal...

متن کامل

Double-bagging: combining classifiers by bootstrap aggregation

The combination of classi"ers leads to substantial reduction of misclassi"cation error in a wide range of applications and benchmark problems. We suggest using an out-of-bag sample for combining di0erent classi"ers. In our setup, a linear discriminant analysis is performed using the observations in the out-of-bag sample, and the corresponding discriminant variables computed for the observations...

متن کامل

Small Sample Statistics for Classi cation Error Rates I : Error Rate Measurements

Several methods (independent subsamples, leave-one-out, cross-validation, and bootstrapping) have been proposed for estimating the error rates of classiers. The rationale behind the various estimators and the causes of the sometimes con BLOCKINicting claims regarding their bias and precision are explored in this paper. The biases and variances of each of the estimators are examined empirically....

متن کامل

Clustering financial time series: an application to mutual funds style analysis

Classi#cation can be useful in giving a synthetic and informative description of contexts characterized by high degrees of complexity. Di3erent approaches could be adopted to tackle the classi#cation problem: statistical tools may contribute to increase the degree of con#dence in the classi#cation scheme. A classi#cation algorithm for mutual funds style analysis is proposed, which combines di3e...

متن کامل

The Error-reject Tradeoff

We investigate the error versus reject tradeo for classi ers. Our analysis is motivated by the remarkable similarity in error-reject tradeo curves for widely di ering algorithms classifying handwritten characters. We present the data in a new scaled version that makes this universal character particularly evident. Based on Chow's theory of the error-reject tradeo and its underlying Bayesian ana...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996